Generating a genome assembly with PCAP.
نویسندگان
چکیده
This unit describes how to use the Parallel Contig Assembly Program (PCAP) to assemble the data produced by a whole-genome shotgun sequencing project. We present a basic protocol for using PCAP on a multiprocessor computer in a 300-Mb genome assembly project. A support protocol to prepare input files for PCAP is also described. Another basic protocol for using PCAP on a distributed cluster of computers in a 3-Gb genome assembly project is presented, in addition to suggestions for understanding results from PCAP.
منابع مشابه
PCAP: a whole-genome assembly program.
We describe a whole-genome assembly program named PCAP for processing tens of millions of reads. The PCAP program has several features to address efficiency and accuracy issues in assembly. Multiple processors are used to perform most time-consuming computations in assembly. A more sensitive method is used to avoid missing overlaps caused by sequencing errors. Repetitive regions of reads are de...
متن کاملApplication of a superword array in genome assembly
We introduce a data structure called a superword array for finding quickly matches between DNA sequences. The superword array possesses some desirable features of the lookup table and suffix array. We describe simple algorithms for constructing and using a superword array to find pairs of sequences that share a unique superword. The algorithms are implemented in a genome assembly program called...
متن کاملFosmid-based physical mapping of the Histoplasma capsulatum genome.
A fosmid library representing 10-fold coverage of the Histoplasma capsulatum G217B genome was used to construct a restriction-based physical map. The data obtained from three restriction endonuclease fingerprints, generated from each clone using BamHI, HindIII, and PstI endonucleases, were combined and used in FPC for automatic and manual contig assembly builds. Concomitantly, a whole-genome sh...
متن کاملPhysical map-assisted whole-genome shotgun sequence assemblies.
We describe a targeted approach to improve the contiguity of whole-genome shotgun sequence (WGS) assemblies at run-time, using information from Bacterial Artificial Chromosome (BAC)-based physical maps. Clone sizes and overlaps derived from clone fingerprints are used for the calculation of length constraints between any two BAC neighbors sharing 40% of their size. These constraints are used to...
متن کاملThe Child Abuse Potential Inventory and pregnancy outcome in expectant adolescent mothers.
OBJECTIVE The study explores the prenatal Child Abuse Potential (pCAP) scores derived from the Child Abuse Potential Inventory administered to expectant adolescent mothers. The aim of the study was to assess the association of the pCAP scores with maternal negative prenatal behaviors, and evaluate the contribution of the pCAP scores to neonatal morbidity. METHOD The pCAP scores, demographic d...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Current protocols in bioinformatics
دوره Chapter 11 شماره
صفحات -
تاریخ انتشار 2005